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Gravitational redshifts from large-scale structure 
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ABSTRACT 

The recent measurement of the gravitational redshifts of galaxies in galaxy clusters 
by Wojtak et al. has opened a new observational window on dark matter and modi- 
fied gravity. By stacking clusters this determination effectively used the line of sight 
distortion of the cross-correlation function of massive galaxies and lower mass galax- 
ies to estimate the gravitational redshift profile of clusters out to 4 h~ 1 Mpc Here 
we use a halo model of clustering to predict the distortion due to gravitational red- 
shifts of the cross-correlation function on scales from 1 ~ 100 /i _1 Mpc. We compare 
our predictions to simulations and use the simulations to make mock catalogues rel- 
evant to current and future galaxy redshift surveys. Without formulating an optimal 
estimator, we find that the full BOSS survey should be able to detect gravitational 
redshifts from large-scale structure at the ~ 4tr level. Upcoming redshift surveys will 
greatly increase the number of galaxies useable in such studies and the BigBOSS and 
Euclid experiments should be capable of measurements with precision at the few per- 
cent level. As has been recently pointed out by McDonald, Kaiser and Zhao et al, 
other interesting effects including rclativistic beaming and transverse Dopplcr shift 
can add additional asymmetric distortions to the correlation function. While these 
contributions are subdominant to the gravitational redshift on large scales, they rep- 
resent additional opportunities to probe gravitational physics and indicate that many 
qualitatively new measurements should soon be possible using large redshift surveys. 
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1 INTRODUCTION 

In the weak field limit, the gravitational redshift, z g of pho- 
tons with wavelength A emitted in a gravitational potential 
(f> and observed at infinity is given by z g — -£■ ~ —$■. Mea- 
surement of z g is one of the fundamental tests of General 
Relativity (GR). First measured more that 50 years ago for 
the Earth's gravity in a laboratory setting (Pound & Rebka 
1959), subsequent determinations have been made in the so- 
lar system (Lopresto et al. 1991) and from spectral line shifts 
in red giant stars (e.g., Greenstein et al. 1971). In this paper 
we will examine how well the gravitational redshifts caused 
by the largest potential fluctuations in the Universe can be 
measured using galaxy redshift surveys. 

Predictions for the gravitational redshifts of galaxies 
in clusters were computed using analytic models by Cappi 
(1995, see also Nottale 1990). Cappi found that in the most 
massive ( ^ 10 /i _1 Mq) clusters the central galaxy is ex- 
pected to have a redshift of a few tens of km s _1 with re- 
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spect to other cluster members. Kim & Croft (2004, here- 
after KC04) showed that instead of using single extremely 
massive clusters, large galaxy surveys could be used to make 
a statistical measurement of the gravitational redshift pro- 
file. McDonald (2009, hereafter M09) examined the issue in 
Fourier space and pertubation theory, studying the effect of 
gravitational redshifts on the large-scale cross-power spec- 
trum of different populations of galaxies. 

The first observational determination of galaxy grav- 
itational redshifts due to their large-scale environment 
was made by Wojtak et al. (2011, hereafter Wll) using 
galaxy redshift data from the Sloan Digital Sky Survey 
(SDSS). Wll used 125000 galaxies in 7800 galaxy clusters 
to make statistical measurement of z g vs. brightest clus- 
ter galaxy distance out to a radius of 4 /i _1 Mpc (h = 
-Ho/100 km s _1 Mpc -1 ). Their 2.6a measurement was com- 
pared to modelling of the mass distribution from galaxy ve- 
locity dispersions and used to put constraints on modified 
gravity models. Dominguez-Romero (2012) also used SDSS 
data to carry out such a test and also found good agreement 
with GR. 
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The redshift of a galaxy is a sum of 3 components, the 
Hubble redshift, the Doppler redshift from the line of sight 
peculiar velocity and the gravitational redshift: 



H(z)r- 



+ cz„ 



(1) 



where H(z) is the Hubble parameter and r the radial dis- 
tance between the observer and galaxy in Mpc. The autocor- 
relation function of galaxies is often measured as a function 
of separation between pairs of galaxies along the line of sight 
(r\\) and across it (r±). The v pcc and cz g terms in Equation 
[l] distort the correlation function of galaxies in the ni di- 
rection. For the autocorrelation this distortion is symmetric 
about A(?-||) = 0. However, more highly clustered galaxies 
(in practice also more massive) lie in deeper potential wells 
and so have a higher cz g on average than less clustered (and 
less massive) galaxies. This means that the cross-correlation 
function of two subsets of galaxies with different clustering 
strengths will be asymmetric about the r± axis. If the cross- 
correlation function is centered on the most massive galaxy 
the surrounding galaxies will have an relative blueshift. This 
blueshift can be measured as a function of galaxy pair sep- 
aration and therefore provide a measure of the scale de- 
pendence of the large scale gravitational potential around 
galaxies. 

The cross-correlation can be carried out between any 
subsets of galaxies. In KC04 and Wll the higher mass subset 
were brightest cluster galaxies (BCGs) and the lower mass 
subset were other cluster members and nearby field galax- 
ies. M09 considered two general populations of galaxies with 
different bias parameters and showed that in Fourier space 
the gravitational redshift leads to an imaginary term in the 
cross-spectrum. The analysis of M09 was carried out in lin- 
ear theory, but also included other terms in the distortion of 
clustering (related to the transformation from observable to 
proper coordinates, see Yoo et al. 2009,2011). Our current 
study will include non-linear scales and make use of N-body 
simulations to formulate and test the simplest measurement 
scheme. Our aim is to be able to unify the two regimes that 
were considered previously, virialized objects and large scale 
linear structure and formulate a model and analysis method 
applicable to all scales. 

Because the predicted gravitational redshift of galaxies 
is typically much smaller than the other terms in Equation 
[l] there are several other effects which need to be consid- 
ered when devising a scheme to measure z g from galaxy 
clustering. Most of these effects have been considered in the 
context of galaxy clusters and are most important in viri- 
alized regions. For example, Zhao et al. (2012) have shown 
that that because the galaxies around the cluster center are 
not at rest, special relativistic time dilation leads to an addi- 
tional wavelength shift between those and the center (BCG 
in the case of a cluster). Zhao et al. term this the Transverse 
Doppler effect and point out that it will be opposite in sign 
to z g . Kaiser (2013, hereafter K13) recently explored several 
more effects, including special relativistic beaming which 
will cause infalling galaxies behind a cluster to be brighter. 
This will boost the number above a magnitude limit behind 
a cluster compared to in front and give an asymmetry of 
similar order to the z g effect. K13 (see also KC04) also con- 
sidered the cluster in the context of an infalling density field 
that joins the Hubble flow at large radii. This and the effects 
explored in M09 can in principle be modelled and marginal- 



ized over when looking the effect of gravitational redshifts, 
so that we leave a detailed study of their properties to fu- 
ture work. We give some comments on their likely relative 
magnitude and other potential effects in Section \5. 21 but in 
the present paper we concentrate on predictions for z g . 

Our our plan for the paper is as follows. In Section 2 
we use halo model-inspired fits to the galaxy-mass cross- 
correlation to predict the mean relative z g between pairs of 
galaxies in two distinct populations. We also compute how 
this z g will act together peculiar velocities to distort the 
cross-correlation function of galaxies from the two popula- 
tions, focussing on the residual asymmetry about the r± 
axis. We test our model with simulations in Section 3. In 
Section 4 we make simple mock catalogues relevant to cur- 
rent and future redshift surveys to estimate error bars which 
could be expected with observational data. In Section 5 we 
summarise our results and comment on other sources of 
asymmetric distortion in galaxy clustering. 



2 LARGE-SCALE CLUSTERING AND 
GRAVITATIONAL REDSHIFTS 

A way to measure statistical information on the large-scale 
gravitational potential is to examine how a galaxy corre- 
lation function is distorted by gravitational redshifts. The 
theoretical prediction is effectively the same calculation as 
carried out for the cluster redshift profile by Nottale (1990) 
and later authors, but on larger scales. The auto-correlation 
function has a pairwise symmetry even with z g included, 
but this is broken for the cross-correlation function of two 
populations of galaxies with different masses (or bias param- 
eters). We then need to calculate, given the higher mass (or 
bias) galaxy at the origin, what the relative blueshift is for 
the lower mass galaxies clustered around it. 

We take a set of galaxies and split them into two sub- 
sets by mass, labelling the higher mass subset gl and the 
lower mass subset g2. After averaging over pairs of gl and 
g2 galaxies, the mean gravitational redshift difference be- 
tween g2 galaxies and gl galaxies separated by distance r 
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where Mi2(r) is the mass given by 



(2) 



M 12 {r) = Airp / (^ g i P (x) - ^ g2p (x))x 2 dx. (3) 

Jo 

Here £ 9 i p is the gl galaxy-mass cross-correlation function, 
£ S 2p is the g2 galaxy-mass cross-correlation function, p is the 
mean density of the Universe, c is the speed of light and G 
is Newton's gravitational constant. 

Equation [2] is the prediction for the gravitational red- 
shift in General Relativity. In order to give a simple example 
of what some alternative theories predict for this quantity, 
we will also make use of predictions for the f{R) gravity 
model used by Wll, derived from one of the models sim- 
ulated by Schmidt (2010). For this, model it was found by 
Schmidt (2010) that z g (r) in virialized regions should be 
multiplied by a factor of 1.33. We note that f(R) gravity the- 
ories can give various values for the difference between the 
dynamical and lensing potential (Schmidt 2010) and that 
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our use of this result is primarily illustrative of the rough 
size of possible effects from non-GR models. Also, while we 
will plot this f(R) curve on larger than intracluster scales it 
has not been shown to be relevant there and in this case acts 
to guide the eye, illustrating the effect of a 33% difference 
from GR. 

For £ s i p and £ g 2 P we use a fit inspired by the fact that 
clustering of galaxies and dark matter can be described by 
terms which are a combination of internal halo structure and 
large-scale, linear clustering (the halo model see e.g., Sheth 
and Cooray 2002 for a review). We use 



£ 9P (r) = po 



(1 



R. 



■Y 



+ b£cDM(r) 



(4) 



where we assume that the internal galaxy halo structure fol- 
lows the Navarro et al. (1997) relation (po and R s are free 
parameters) and £cdm is the linear theory Cold Dark Mat- 
ter correlation function (computed using Lewis & Challinor, 
2011). Here 6 is a linear bias parameter relating galaxy and 
matter clustering on large scales and is a monotonically in- 
creasing function of galaxy mass (see e.g. Mo and White 
1996). 

Having computed z g (r) we use it to distort the cross- 
correlation function of gl and <?2 galaxies in redshift space, 
£gig2(s±,S||). Here s± and sii are respectively the distance 
between pairs of gl and g2 galaxies perpendicular and par- 
allel to the line of sight, and s\\ can be either positive or 
negative. 

There is also the additional distortion of £gig2{s±,S\\) 
due to peculiar velocities. As in Croft et al. (1999), we model 
this using a spherical infall model for large scale flows (taken 
from Yahil 1985) and a small scale random velocity disper- 
sion (e.g., Davis and Peebles 1983). The infall model is 



Vinfall(»") 



n 0.6 rr 

S2 H r- r 



6(r) 



(5) 



where S(r) is the matter overdensity averaged within radius 
r of gl galaxies: 



*(»•) = 3 



— / iglp{x)x 2 dx 
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(6) 



Croft et al. (1999) found that this velocity field model gives a 
better match than linear theory for the infall pattern around 
galaxy clusters. Because here we are interested in the red- 
shift distortions around massive galaxy halos, we expect the 
model to work well in the current context also. This is borne 
out in our tests with numerical simulations (SectionfS]). Be- 
cause Uinfaii(r) is not expected to describe the virialized re- 
gions of clusters, we follow Croft et al. (1999) and truncate 
VinfaiiM on small scales by multiplying by an exponential, 
exp-(<5/50). 

The random velocity dispersion we use is an exponential 
model, so that the distribution function of velocities is 
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where o\2 is the pairwise velocity dispersion of gl — g2 pairs 
of galaxies, which we assume to be independent of pair sep- 
aration. Based on simulation results we take this value to 
cm = 400kms _1 . 

After applying the infall model, gravitational red- 
shift and convolving with f(v), the redshift space cross- 
correlation function is therefore: 



/oo 
dvf(v)£gi g2 ,i S o {rx,r\\ , & s 

- Hg (cz g (r) - r||r _1 u infal i(r) - v)) , 

(valid at redshift 2 = 0) where £ g i g 2, iso is the isotropic (real 
space) cross-correlation function of gl and gl galaxies, which 
we also model using Equation [3] 

To make theoretical predictions for £ g i 9 2 we fit the free 
parameters in Equations [4] and [7] (b, po, cm) using sim- 
ulations (see Section |3}. In Figure Q] we show contours of 
£ g i g 2(r±,r II )> illustrating the effects of the different redshift 
distortions. We use parameters in Equation [4] which are ap- 
propriate for halos with mass > 3 x 10 13 /i _1 Mq, where the 
two populations of galaxies gl and g2 are the high and mass 
low halves of the set of halos. 

Because the gravitational redshift is so small, for illus- 
trative purposes we have multiplied z g by a factor of 500 
when making Figure [l] This should be borne in mind when 
assessing the relative effects shown. The top left panel of 
Figure [T] shows the undistorted, isotropic correlation func- 
tion (Equation [1). In panel (b) we can see that the effect of 
gravitational redshifts without peculiar velocities is to shift 
the contours of £ 9 i g 2 downwards, corresponding to a relative 
blue shift for the g2 galaxies clustered around the gl galax- 
ies. In panel (c) the peculiar velocity distortion has been 
applied on its own, resulting in a distortion of £ s i g 2 which 
has reflection symmetry about the r± axis. The large scale 
squashing of the contours can be seen (the Kaiser (1987) 
effect) as well as the small scale elongation of £ g i S 2 due to 
the random velocities. 

In panel (d) we show gravitational and peculiar velocity 
redshift distortions together. We can see that as the effect 
of Kaiser (1987) infall is to boost the correlation function 
overall, this will also enhance the strength of the asymmetric 
signal due to gravitational redshifts. This illustrates that 
both peculiar velocity and gravitational redshift distortions 
will need to be modelled together in order to make precise 
constraints on cosmological theories using z g (r). 

Given a set of gl-g2 galaxy pair separation measure- 
ments, one now needs to formulate an estimator of the asym- 
metry of clustering which can probe z g (r). In the galaxy 
cluster case (KC04, Wll), the pair separations were binned 
into cylindrical shell bins, which was appropriate because 
the clusters were being treated as distinct objects. In the 
current large-scale structure case, we have decided to in- 
stead bin the pairs in spherical shell bins, and our statistic 
sensitive to z g (r) is the mean ry position of the pair- weighted 
centroid of each shell: 



V) = 



/;, +Ar tfr ll [l+£(rx ) r ll )]r 2 dr 
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[1 + £(r_i_,r||)]r 2 dr 



(9) 



This estimator z g ° (r) will tend to zero at large and 
small scales. On small scales this is because the Zg hell (r) 
shift cannot be larger than the spherical bin radius. On large 
scales this is because the clustering tends towards homogene- 
ity and it is not possible to detect a blueshift or redshift of 
a homogeneous set of particles. We will explore the exact 
shape of z g hell (r) in Section[3]below. We note that other mea- 
sures of the asymmetric distortion of the correlation function 
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Figure 1. Redshift distortions of the cross-correlation function of two populations of galaxies (high and low mass), using a halo model 
of galaxy clustering (Section [2} . We show the effect of peculiar velocities and gravitational redshifts separately (top right and bottom 
left panel respectively) and together (bottom right). The magnitude of the gravitational redshift component has been magnified by a 
factor of 500 in order to make its effect easily visible. 



could be chosen. For example one could bin galaxy pairs not 
in spherical bins but in bins matched to the expected shape 
of contours of £gig2{r±,r\\). Or else one could imagine car- 
rying out a full fit to the observed £ g i S 2(r_L, r\\ data varying 
parameters in our theoretical model. In the present paper 
we restrict ourselves to the simple estimator of distortion in 
Equation [9] and leave exploration of possibly more sensitive 
measures of z g (r) to future work. 



3 SIMULATION TESTS 

We now compare the results of the theoretical predictions 
for the z g distortion of Section [5] to results from numeri- 
cal simulations. It should be borne in mind that both the 
galaxy-mass cross-correlation function and the gl-g2 galaxy 
cross-correlation function enter into the predictions for the 
observable quantity (Equation[9]) and so to make predictions 
we will make some simplifying assumptions about galaxy 



formation. Here we will associate galaxies with dark matter 
subhalos and assume that applying a dark matter mass cut 
to a subhalo population is equivalent to applying a luminos- 
ity cut to galaxies. 



3.1 Simulations and galaxies 

We have used the P-Gadget (see Springel 2001,2005, Khandai 
et al. 2011) N-body code to run 10 realizations of a ACDM 
universe. The cosmological parameters used were : ampli- 
tude of mass fluctuations, erg = 0.8, spectra index, n 3 = 0.96, 
cosmological constant parameter Q,a = 0.74, and mass den- 
sity parameter Q m = 0.26. The cubic periodic box side 
length was 1 h~ Gpc, and the number of particles 768 3 per 
realization, leading to a particle mass of 1.5 x 10 11 /i _1 Mq. 
The gravitational force resolution was 20/i _1 kpc, so that 
subhalo structure is not well resolved (we return to this be- 
low). 
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The bound structure finder SUBFIND (Springel 2001) 
was used to find galaxy sized subhalos. Because the cur- 
rent relevant redshift surveys (e.g., BOSS, Ahn et al. 2012) 
are focussed on massive galaxies, we make two different 
subhalo samples, one where subhalos are above a mass 
limit of 10 13 h _1 Mq (64 particles) and the other above 
3 x 10 13 /i _1 M . 

We compute the gravitational redshift z g of the central 
particle in each subhalo, and assign that to be the z g as- 
sociated with the galaxy inside the subhalo. We have also 
tried averaging the gravitational redshifts of all particles in- 
side each subhalo instead. We find that while this makes a 
small difference to the individual z g values it does lead to 
problems with the pairwise differences between satellite and 
central galaxies. This is because while the central part of 
a central subhalo may be in a deeper potential well than 
nearby satellites, averaging particle z g values over the en- 
tirety of the dark matter subhalo decreases the redshift and 
can often lead to a blueshift with respect to nearby small 
satellites. This would not happen observationally as the stel- 
lar parts of galaxies are more centrally concentrated than the 
dark matter. We therefore account for this by using only the 
central particle to assign z g to the galaxy. 

As the gravitational potential is very smooth, the dis- 
tribution of gravitational redshifts of galaxies will be also. It 
is instructive to plot a slice through the simulation to see the 
nature of the z g fluctuations which we will be characterising. 
We have done this in Figure[2] where we show z g on a colour 
scale with galaxy mass denoted by symbol size. We can see 
that the most massive galaxies are the most clustered, as 
expected, and also that the galaxies in the deepest poten- 
tial well (visible near the center of the slice) are also the 
most massive. The length scale of the visible z g fluctuations 
is extremely large, with the main potential well covering 
most of the /i -1 Gpc volume. We will be seeking to measure 
this structure by measuring z g differences between pairs of 
galaxies on scales up to 100 h~ 1 Mpc. On average the most 
massive galaxies will tend to be more clustered and have 
more positive redshifts (red in Figure [5]) and this is what we 
will measure through the effect of this small shift relative to 
the nearby lower mass galaxies. 

After making a galaxy catalogue from a simulation with 
positions, peculiar velocities and gravitational redshifts de- 
fined as above, we apply a mass cut to split it into two halves 
gl and gl containing an equal number of galaxies. For the 
M > 3 x 1O 13 /i _1 M sample, for which there are an average 
of 10 galaxies in each cubic h~ 1 Gpc, the mass thresh- 
old which does this is 5.1 x 10 li h~ 1 MQ. In this case, the 
mean mass of the low mass half (gl) is 3.8 x 1O 13 /i _1 M 
and of the high mass half (g2) 1.2 x 10 14 /i _1 M o . For 
the M > 3 x 1O 13 /i _1 M sample a mass threshold of 
1.8 x 1O 13 /i _1 M split the sample into two halves, of mean 
mass 1.3 x 1O 13 /i _1 M and 5.6 x 1O 13 /i _1 M . We note that 
defining the gl and g2 galaxies to be equal halves of a sam- 
ple may not be the optimal split to get a measurement of 
gravitaional redshift distortions. For example, in the galaxy 
cluster (e.g., Wll) case the gl galaxies were all the non-BCG 
galaxies and so comprised most of the sample. 
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Figure 2. Gravitational redshifts of dark matter subhalos in 
a simulation. We show a 10 h _1 Mpc thick slice through a 
1 h Gpc N-body simulation cube at redshift 2 = 0. Dark matter 
subhalos are shown with symbol size proportional to (mass) 1 ' 3 
and color that indicates their gravitational redshift (positive num- 
bers for redshift). In this case the redshift is not relative to infinity 
but instead relative to the mean level in the Universe. 



3.2 Pairwise z g (r) from simulations 

In the simulations, the gravitational redshift of each galaxy 
is available, so that one can directly compute the mean z g 
difference between pairs of gl and <?2 galaxies as a function 
of pair separation. While this would not be available obser- 
vationally, it is a useful first test of the prediction for z g from 
Equation [2] For all pairs of gl and gl galaxies in the simula- 
tion we compute the redshift difference Az g = z 9 , 9 2 — z Bl gi 
and bin the results as a function of pair separation. We show 
the results in Figure [3] as points, with error bars derived 
from the standard deviation among the 10 simulation re- 
alizations. We also show the predictions made using Equa- 
tion [2] as a smooth line. On small scales r £ 0.5 7i~ Mpc, 
the theoretical predictions are relatively sensitive to the in- 
ternal structure of subhalos, in a regime which is not well 
resolved in the simulations. To deal with this, when integrat- 
ing Equation [4] we introduce a free parameter r cut so that 
Csp( r ') = ?9p( r 'cut) for r < r cut , and we adjuste r cut so that 
the simulation and theoretical results in Figure [3] match at 
the small scale end. 

As expected, the g2 galaxies have a relative blueshift, 
and this increases as a function of distance from the gl 
galaxy, reaching ~ 4 km s _1 at 100 /i _1 Mpc in panel (a) 
(galaxies above mass 3 x lO 13 /i _1 M and ~ 3 km s _1 for 
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Figure 3. Mean averaged gravitational rcdshift difference Az g 
between pairs of dark matter subhalos as a function of separation. 
In each panel, the lower limit for the mass of subhalos in the 
sample is given. For each panel, the sample was split into two by 
mass (high and low mass halves) and Az g was computed from 
mean difference in gravitational redshifts for galaxies in the two 
samples as a function of pair separation. Results for simulations 
are shown by points with error bars. The solid curves show results 
computed by integrating a simple analytic fit to the subhalo-mass 
cross-correlation function (see Equation [4j 



Equation [2] deals with averaged mass and gravita- 
tional redshift profile around galaxies. There will however 
be galaxy to galaxy variation of this quantity, and this will 
lead to a dispersion in gravitational redshifts as a function 
of scale. Because this dispersion will be symmetric about 
j"H = 0, in observational data it will be completely hidden 
by the much larger symmetric signal due to the dispersion 
in peculiar velocities and so unobservable. It is nevertheless 
instructive to measure the dispersion about the mean Az g 
profile in simulations. One reason for this is that in Section[6] 
we will make mock catalogues where model the signal from 
a much larger number of galaxies by boosting z g . This will 
also amplify the contribution of the dispersion in Az g so 
that we need to understand its magnitude. 

The results for the dispersion about Az g (r), a(Az g (r)) 
are shown in Figure [4] We have also plotted the theory 
curves for Az g (r) from Figure[3](we have no theoretical pre- 
diction for a(Az g (r)) to guide the eye. We can see that the 
dispersion about the mean profile is approximately equal to 
the mean profile until about ~ 2 — 5 /i _1 Mpc and after this it 
rises relatively steeply until it is of the order of ~ 10 km s _1 
at r = 100 /i _1 Mpc. It is still much smaller than the random 
redshift dispersion from peculiar velocities, however (which 
is ~400kms _1 ). 
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Figure 4. a(Az g ), the dispersion about mean averaged gravita- 
tional redshift difference Az g plotted in Figure [3] We again give 
in each panel the lower limit for the mass of halos in the sample. 
Results for simulations arc shown by points with error bars. The 
solid curves not a theoretical prediction for this quantity, but in- 
stead are the theory curves for z g from Figure[3]to aid comparison 
of the two plots. 



panel (b), the lower mass galaxy catalogue. There is a knee 
in the Az g curves at r ~ 2 — 3 /i _1 Mpc, related to the tran- 
sition in £ 9P between the regime internal to subhalos and 
large scale linear cross-correlation function. The theoretical 
predictions broadly follow the simulation results, although 
there are kinks in the simulation trend at smaller scales. Fu- 
ture work with higher resolution simulations will be needed 
to investigate how well the halo model prediction can be 
made to fit on these scales. In the present case, we will see in 
Section [3]3] that some information on these scales is washed 
out by peculiar velocity distortions in any case. 



3.3 Gravitational redshift estimator 

Having seen that the intrinsic pairwise gravitational redshift 
differences between galaxies in simulations can be modelled, 
we now turn to the estimator Zg hcll (r) of Equation [5] which 
can be applied to observational data. Applying the estimator 
to simulation data in a periodic volume with a uniform se- 
lection function is equivalent to finding the centroid of shells 
containing pairs of galaxies at different separations. In ob- 
servational data a random catalogue would need to be used 
to account for the uniform survey coverage, volume of bins 
and so on. 

We would like to show a comparison between simulation 
and theory for the z g hell (r) statistic. One issue is that the 
z g component of the redshift is extremely small, compared 
to the other redshift components, so that averaging over an 
extremely large number of simulated galaxies is needed to 
give a comparison with good signal to noise. Rather than 
running hundreds more simulations, we instead choose to 
boost the contribution of the z g component by multiplying 
it by a large factor (as was done to make Figure [TJ. When 
computing the theoretical prediction from Equation [8] we 
can also multiply z g by the same factor (we use a factor of 
100 here) and therefore make a consistent comparison. We 
then divide the final results by this factor in order to plot the 
z g curves with the amplitude they would have had without 
boosting them (therefore only the relative contribution of 
the noise has been changed). 

In Figure \5\ we show Zg hell (r) for the simulations as 
points and theoretical predictions as lines, for both galaxy 
catalogues. We show the results separately without and with 
peculiar velocities (top and bottom panels respectively). 
When showing the results without peculiar velocities, the 
simulations include the dispersion about the mean z g profile 
plotted in Figure U whereas the theory does not model this. 
Because boosting the z g values in the simulation by a large 
factor to make the signal to noise low also boosts the disper- 
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Figure 5. The gravitational rcdshift estimator z g (r) (see Equation[9]| plotted as a function of shell radius r. We show results measured 
from simulations as points (with error bars derived from the scatter between 10 realizations). The solid lines show the halo model inspired 
predictions of Equations [2] and [9] for both general relativity (black line) and the f(R) model of Section [2] The top panels show results 
with no peculiar velocities added to galaxy redshifts and the bottom panels show the results including all terms in Equation [l] and 
represent the observable quantity. The left and right panels show results for samples of subhalos with different lower mass limits (given 
in the legend). 



sion a(A(z g ), this means that comparison with the theory 
curves is not possible on small scales. We can see that the 
theory curves track the large scale simulation plots that are 
plotted well. 

The shape of the curves in Figure [S] shows the blueshift 
of galaxies surrounding more massive ones, which produces 
a larger and larger asymmetry in the centroid shift of shells 
until between 6-9 h~ Mpc when the signal decreases. The 
exact point where this happens will depend on the shape 
of the galaxy-mass cross-correlation function. As the large 
scale galaxy density field becomes more uniform on large- 
scales, this shell centroid blue shift becomes smaller. On 
largest-scales (r > 50 — 60 h~ 1 Mpc), the shell centroid actu- 
ally becomes slightly redshifted with respected to the central 
galaxy. 

In the bottom panel of Figure [5] we show the results 
in redshift space, where we can see that because the pe- 
culiar velocities are dominant in smoothing out the correla- 
tion function on small scales that the theory and simulations 
agree reasonably well there also. This is obviously the situ- 
ation which will apply when dealing with observations, and 
so any remaining differences on small scales between theory 
and simulations will be due to a combination of differences in 
the gravitational redshift profile (Figure [3} and the peculiar 
velocity modelling. 

If we compare the top and bottom panels of Figure 
we can see that the effect of peculiar velocities is to slightly 
increase the signal on scales r > 5 /i _1 Mpc. It can be seen 
that the maximum blueshift of g2 galaxies in shells around 
gl galaxies is around 1 km s~ . This is signficantly less than 



the blueshifts of up to 14 km s _1 seen by Wll, and shows 
that by splitting the galaxies into equal halves (of gl and gl 
galaxies) we make the signal numerically much smaller than 
the redshift profile around rarer galaxies at the centers of 
massive clusters. The number of pairs of galaxies is obviously 
maximized by our choice, but the signal to noise of the z g 
measurement may not be. We leave the question of how best 
to partition a galaxy sample to make cross-correlations and 
also the best z a estimator to use to future work. 



4 MOCK CATALOGUES 

In order to make some approximate predictions for results 
that can be expected by measuring z g hcll (r) from current 
and upcoming large redshift surveys, we have made some 
very simple mock catalogues from our simulations. The three 
galaxy redshift surveys which we deal with are 

(a) the full SDSS/BOSS galaxy survey (Aihara et al. 2011, 
Eisenstein et al. 2011) which will be complete in 2014 and 
will contain 1.5 x 10 6 Luminous Red Galaxies (LRGs) below 
redshift z — 0.7. 

(b) the BigBOSS galaxy redshift survey (Schlegel et al. 2009) 
which will measure redshifts for 15 x 10 6 Emission Line 
Galaxies (ELGs) between redshifts z — 1 and z = 2 and 
3.5 x 10 6 LRGs between redshifts z = 0.2 and z = TO. The 
survey us expected to begin in 2017 and finish in 2022 

(c) the Euclid spectroscopic survey (Laureijs et al. 2011) 
which will use space-based slitless spectroscopy to measure 
redshifts for between 50 - 80 x 10 6 ELGs from z = 0.5 to 
z = 2.0. The expected launch date for Euclid is 2020. 
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Figure 6. The points show z g measured by observing mock catalogs made from simulations. The error bars were computed from the 
standard deviation of the measurement from 10 subsamples in each case. We show predictions for the full BOSS survey, BigBOSS and 
Euclid galaxy redshift surveys. The lines are the predictions of Section [2] which used a halo model fit to the halo-matter cross-correlation 
function. The black curve shows the prediction of General Relativity and the red curve the simple estimate of an f(R) gravity model 
mentioned in Section [2] 



the cross-correlation of two populations of galaxies is lim- 
ited only by shot noise and not by cosmic variance. This 
is the same property which enables peculiar velocity red- 
shift distortions to be measured limited only by shot noise 
(McDonald & Seljak 2009). Therefore the signal to noise 
of a z g measurement in a survey of volume V containing 



iYs, 



nV galaxies, scales as V 1 ' 2 , and also as n 1 ' 2 as 



these quantities are changed (M09). As a result when mak- 
ing predictions, to first order the number of galaxies N SUIvey 
is the important quantity and it is a reasonable approxi- 
mation to scale our simulations by a factor to account for 
the differing shot noise resulting from the different numbers 
of galaxies in each survey. Although we have iV-body simu- 
lations for only 10 (1 h~ 1 Gpc) 3 volumes, with a maximum 
4 x 10 5 galaxies of mass > 1Q 13 Ii~ 1 Mq per volume, this scal- 
ing to account for the number of galaxies allows us to make 
predictions for surveys containing many more galaxies. 

Other simplifications which we make here are to make 
the z— axis of the simulation boxes be the line-of-sight (the 
plane-parallel approximation), rather than modelling any 
actual survey geometrical effects. We also carry out all mea- 
surements at a single redshift z — (in practice there will 
also be a dependence on the mean redshift of the survey e.g., 
the perturbation growth factor at Z — 1 is 0.62 for the stan- 
dard cosmology so perturbations in the gravitational poten- 
tial will be 38% smaller). We use the galaxy catalogue with 
mass limit m>3x lO 13 fe _1 M0 to model surveys (a) and 
(b) and the mass limit m > 10 13 '/i _1 M@ to model survey 
(c). For survey (b) we have also not modelled the ELG and 
LRG populations separately, but only include in our mock 
the 15 x 10 6 ELGs. We also do not include other asymmetric 
distortion terms in the correlation function, such as Trans- 
verse Doppler effect (we discuss them in Section T5. 21 below - ). 
We leave more sophisticated modelling of observational pre- 
dictions to future work. Our aim here is to make a rough 



assessment of the detectability of z g signals from large-scale 
clustering. 

When making mock surveys (a)-(c) we therefore scale 
the gravitational redshift z g for galaxies to account for differ- 
ing shot noise values. The scaling factor which we multiply 

where y/N S i m is the number of galaxies in 



by is s 



y/j& 



\^h 



all realizations of the simulations added together (10 for the 
3 x 10 13 /i _1 M Q mass limit and 4 x 10 6 for the 10 13 /i -1 M Q 



to be 1.5 x 10 b for full BOSS, 
3 x 10 6 for Euclid. 



mass limit). We take N SUIVi 
15 x 10 6 for BigBOSS and 

In Figure [S] we show the results measured from these 
mock catalogues. The error bars show the error on the mean 
computed from the standard deviation of 10 subsamples 
(the 10 simulation realizations). We can see that the rel- 
ative blueshift as a function of scale should be detectable 
in the full BOSS survey. The difference in chi-squared from 
zero is 16, indicating a 4 a detection. We have assumed a 
diagonal covariance matrix however. Future work should use 
a larger number of simulations to be able to compute and 
include the off diagonal terms. 

The BigBOSS and Euclid surveys should also enable 
the shape of the curve as a function of scale to be traced 
out, with information on z g up a few tens of /i _1 Mpc being 
accessible. The overall amplitude of the curves is measurable 
to 6.5% precision for BigBOSS and 4.0% for Euclid. 



5 SUMMARY AND DISCUSSION 

5.1 Summary 

Using the halo model, we have made predictions for the rel- 
ative gravitational redshifts between pairs of galaxies above 
and below a mass threshold as a function of scale. We have 
shown how this relates to distortions in the galaxy cross- 
correlation function. Comparing this to measurements from 
observational data should enable constraints to be made on 
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gravity models on large scales. In our theoretical and simu- 
lation study we have found that 

(1) The large-scale relative gravitational redshifts be- 
tween simulated galaxies can be predicted well from the halo 
model. On small scales, internal to galaxy halos our compar- 
ison is hampered by limited simulation resolution but still 
gives good enough predictions for the observable quantity 
[(2), below]. 

(2) The gravitational redshifts distort the galaxy cross- 
correlation function so that it is asymmetric about the r± 
axis. When this asymmetry is quantified by measuring the 
pair- weighted centroid shift of spherical shells it peaks at a 
pair separation of ~ 6 — 9 h~ 1 Mpc. 

(3) Peculiar velocities smooth out the correlation func- 
tion in redshift space. This has the advantage that after this 
smoothing the small scale clustering which is most difficult 
to predict does not have to be known so accurately. It also 
has the disadvantage that peculiar velocity modelling will 
form a necessary part of accurate interpretation of gravita- 
tional redshifts. 

(4) Current galaxy redshift surveys such as SDSS/BOSS 
should be able to detect large scale gravitational redshifts 
from the distortion of the correlation function. Future larger 
surveys such as BigBOSS and Euclid should be able to make 
measurements of the distortion out to tens of h~ 1 Mpc scales 
with a precision good enough to differentiate between some 
alternatives to General Relativity. 

5.2 Discussion 

The measurement of gravitational redshifts of galaxies, 
starting with Wll has effectively opened a new window on 
the distribution of matter in the Universe and its gravita- 
tional effects, complementary to weak lensing and redshift 
distortion measurements, but with many different dependen- 
cies on survey geometry, model parameters and underlying 
physics. 

Although larger samples of galaxies are required to 
reach results, gravitational redshifts have several potential 
advantages over competing probes such as gravitational lens- 
ing or peculiar velocities (see also M09). For example, lensing 
effectively gives a constraint on the projected mass density 
(the lensing kernel is broad in redshift), whereas gravita- 
tional redshifts are localized in 3D space. Redshifts also give 
a constraint on the potential at one instant in time, rather 
than an integral over time which is the case for redshift dis- 
tortions from peculiar velocities. 

The measurement of z g from large-scale structure and 
also the combination of those results with measurements 
from galaxy clusters will enable constraints to be put on 
cosmological models (for example the amplitude of the sig- 
nal in Figure [6] will scale with the amplitude of mass fluc- 
tuations, erg). By the same token, the dark matter can be 
probed, for example by examining the difference in the pre- 
dicted profile of galaxy clusters with and without massive 
neutrinos. In order to constrain modified gravity scenarios, 
it will be necessary to combine the data with other measures 
of the gravitating matter, for example through redshift dis- 
tortions. This was done by Wll on cluster scales. On large 
scales, weak lensing and Kaiser (1987) infall have been used 
by Reyes et al. 2010 to make such constraints. 

We note that the calculation of the observability of z g 



effects in large-scale clustering by M09 results in signifi- 
cantly more pessimistic conclusions than ours (M09 esti- 
mates 3 — 5o" detectability from a Euclid-scale experiment). 
However, Wll detected gravitational redshifts at just under 
3<t significance with only of order 10 5 cluster galaxies. This 
indicates that the inclusion of non-linear scale clustering, 
(which we have shown can be modelled well in principle) 
can dramatically increase the chances of making a precise 
measurement. As an example of this, we have tried recom- 
puting the curves shown in Figure [5] but including only the 
linear CDM correlation function term in Equation [4] In this 
case, we find that the curves on scales r > 20 h~ Mpc in 
Figure [5] are similar to the full Equation [4] case, as expected 
(large-scale clustering is not affected). However on smaller 
scales the relative blueshift is reduced by a factor of at least 
20 indicating that non-linear clustering and halo structure 
are both important to the success of z g measurements. 

We have mentioned in Section [T] that several effects 
other than z g can cause asymmetries in measured galaxy cor- 
relation function. With these effects, one can either decide 
that they represent further opportunities to probe physics on 
large scales, or one can decide to model and marginalise over 
them. M09 has shown for example that when using Equa- 
tion[T]the true value of H(z) is not known, only the observed 
redshift of the galaxy. The same is true of the growth param- 
eter determining density fluctuation amplitude. These and 
similar issues lead to terms appearing in the prediction for 
galaxy clustering which have been calculated in the context 
of General Relativity by Yoo et al. (2009,2011). M09 showed 
that these terms appear at the same order in wavenumber k 
as z g terms in the imaginary part of the cross-spectrum. The 
calculation of these terms relies on perturbation theory and 
their amplitude on the k ~ 1 h Mpc -1 scales most relevant 
to the non-linear signal discussed in this paper is likely to 
be small, but should be investigated in future work. 

Related to this, we remind the reader that we have only 
used the plane parallel approximation in our simulations and 
mock catalogues, as well as neglecting evolution across the 
simulation volume. As K13 has explained, geometrical and 
other effects can lead to asymmetries in clustering in red- 
shift space and these terms need to be dealt with. Around 
galaxy clusters, the precise magnitude of geometrical effect 
is related to how galaxies at large distances join the Hubble 
flow. K13 has shown that this can be large at large impact 
parameters and will need to be understood and modelled 
well. For example, when dealing with observational data, 
extremely finely sampled random catalogs should be made 
to ensure that the volume- weighted centroid of shells around 
galaxies is used to compute the statistic plotted in Figure [6] 

Evolution of clustering will lead to asymmetries because 
the overall clustering amplitude on the higher redshift side 
of each galaxy will be lower than on the lower redshift side. 
This will lead to asymmetries in the correlation function. 
These terms are addressed in linear theory by M09 and Yoo 
et al. (2009) and in the context of inflow of galaxies into 
clusters by K13. 

In the virialized regime of clusters, other new terms be- 
come relatively important. For example, Zhao et al. (2012) 
have shown that the Transverse Doppler effect mentioned in 
Section Q] is generically present alongside gravitational red- 
shift because the virial theorem relates the potential to the 
kinetic energy of galaxies. In galaxy cluster models inspired 
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by the data and measurements of Wll, Zhao et al. (2012) 
find that the Transverse Doppler effect (of opposite sign to 
z g ) dominates over z g within the inner ~ 0.2 /i _1 Mpc of 
clusters. At radii r > 3 h~ Mpc, the contribution drops to 
< 5% of z g and so it will be relatively unimportant over 
most of the scales considered in this paper. 

Also important in clusters is the Doppler beaming effect 
(K13) which causes galaxies moving towards the observer to 
have a higher surface brightness and hence luminosity. This 
causes the number of galaxies above a flux limit to vary 
and again causes an asymmetry of galaxy density around 
clusters. K13 has shown in the case of Wll, the flux limit 
lies at the steep end of the galaxy luminosity function, so 
that small changes in flux have large effects on the number 
of galaxies included in the catalogue. The Doppler beaming 
effect has the same sign as z g and K13 find that it is the 
next most important effect after z g for the Wll parameters. 
It also falls off as a fraction of z g at larger radii. 

Apart from general and special relativistic, geometry, 
and clustering evolution effects on the symmetry of cluster- 
ing, one can imagine other uncertainties such as dust ex- 
tinction (less galaxies seen behind a galaxy than in front), 
or lensing magnification (usually causing the opposite effect, 
e.g., Hui et al. 2007). Fang et al. (2011) have examined the 
effect of the former on anisotropies in galaxy clustering and 
find that its effect is largest on large scales and along the 
line of sight. Incomplete knowlege of galaxy formation can 
also lead to uncertainty, as for example in a realistic obser- 
vational case we would need to choose which galaxies to put 
into high and low mass subsamples using their luminosities. 
Also, when comparing with predictions from modified grav- 
ity models, the difference between large scale mass distribu- 
tions inferred from dynamics (peculiar velocity distortions 
on) and from gravitational redshifts will necessitate some 
treatment of how galaxies trace mass. 

Galaxy redshift surveys are not the only datasets which 
could be used to probe the large scale gravitational poten- 
tial with gravitational redshifts. For example, Broadhurst 
et al. 2000 proposed that with high resolution X-ray spec- 
troscopy, intracluster gas could be used to map out z g in 
large galaxy clusters. More relevant to the large-scale struc- 
ture case are 21cm neutral hydrogen surveys (e.g., Peterson 
& Suarez 2012). As M09 has pointed out, the 21cm flux 
field could form a good contrast to the highly clustered field 
probed by bright galaxies and as z g measurements are dif- 
ferential could lead to better measurements. In this case, 
the formalism which we have outlined in this paper should 
make it possible to start modelling the range of scales and 
densities required. 

The current and upcoming generations of large cosmo- 
logical surveys will contain enough data that qualitatively 
new effects such as gravitational redshift distortions to clus- 
tering can be looked for. With only two published detec- 
tions so far, galaxy gravitational redshifts are perhaps two 
decades behind weak lensing as a field. Unlike lensing in the 
early 1990s, however, surveys two orders of magnitude larger 
than those currently analysed are already planned for other 
purposes. These have the promise to bring gravitational red- 
shifts into the precision cosmology era. 
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